Distributed reinforcement learning for self-reconfiguring modular robots

نویسنده

Paulina Varshavskaya

چکیده

In this thesis, we study distributed reinforcement learning in the context of automating the design of decentralized control for groups of cooperating, coupled robots. Specifically, we develop a framework and algorithms for automatically generating distributed controllers for self-reconfiguring modular robots using reinforcement learning. The promise of self-reconfiguring modular robots is that of robustness, adaptability and versatility. Yet most state-of-the-art distributed controllers are laboriously handcrafted and task-specific, due to the inherent complexities of distributed, local-only control. In this thesis, we propose and develop a framework for using reinforcement learning for automatic generation of such controllers. The approach is profitable because reinforcement learning methods search for good behaviors during the lifetime of the learning agent, and are therefore applicable to online adaptation as well as automatic controller design. However, we must overcome the challenges due to the fundamental partial observability inherent in a distributed system such as a selfreconfiguring modular robot. We use a family of policy search methods that we adapt to our distributed problem. The outcome of a local search is always influenced by the search space dimensionality, its starting point, and the amount and quality of available exploration through experience. We undertake a systematic study of the effects that certain robot and task parameters, such as the number of modules, presence of exploration constraints, availability of nearest-neighbor communications, and partial behavioral knowledge from previous experience, have on the speed and reliability of learning through policy search in self-reconfiguring modular robots. In the process, we develop novel algorithmic variations and compact search space representations for learning in our domain, which we test experimentally on a number of tasks. This thesis is an empirical study of reinforcement learning in a simulated latticebased self-reconfiguring modular robot domain. However, our results contribute to the broader understanding of automatic generation of group control and design of distributed reinforcement learning algorithms. Thesis Supervisor: Daniela Rus Title: Professor

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

Designing distributed controllers for self-reconfiguring modular robots has been consistently challenging. We have developed a reinforcement learning approach which can be used both to automate controller design and to adapt robot behavior on-line. In this paper, we report on our study of reinforcement learning in the domain of self-reconfigurable modular robots: the underlying assumptions, the...

متن کامل

On Scalability Issues in Reinforcement Learning for Self-Reconfiguring Modular Robots

Self-reconfiguring modular robots have been receiving great attention because advances in our field are expected to deliver ultra-adaptable and robust systems. There has been remarkable progress in modular hardware and distributed controllers, e.g., [1]–[4], some of which were designed automatically by genetic algorithms, e.g., [1]. But how can the greatest adaptability be achieved? Our positio...

متن کامل

Distributed Learning for Controlling Modular Robots

What: Recently there has been an important research effort into modular, distributed robotics and in particular, self-reconfiguring robotics [2, 5, 8]. Issues with designing controllers for such systems range from constructing motor control primitives to ensuring cooperation between modules. For simpler tasks, such as locomotion in one direction, hand design is easy. However, as modular robots ...

متن کامل

Efficient Distributed Reinforcement Learning Through Agreement

Distributed robotic systems can benefit from automatic controller design and online adaptation by reinforcement learning (RL), but often suffer from the limitations of partial observability. In this paper, we address the twin problems of limited local experience and locally observed but not necessarily telling reward signals encountered in such systems. We combine direct search in policy space ...

متن کامل

A distributed and morphology-independent strategy for adaptive locomotion in self-reconfigurable modular robots

In this paper, we present a distributed reinforcement learning strategy for morphology-independent lifelong gait learning for modular robots. All modules run identical controllers that locally and independently optimize their action selection based on the robot’s velocity as a global, shared reward signal. We evaluate the strategy experimentally mainly on simulated, but also on physical, modula...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Distributed reinforcement learning for self-reconfiguring modular robots

نویسنده

چکیده

منابع مشابه

Automated Design of Adaptive Controllers for Modular Robots using Reinforcement Learning

On Scalability Issues in Reinforcement Learning for Self-Reconfiguring Modular Robots

Distributed Learning for Controlling Modular Robots

Efficient Distributed Reinforcement Learning Through Agreement

A distributed and morphology-independent strategy for adaptive locomotion in self-reconfigurable modular robots

عنوان ژورنال:

اشتراک گذاری